SAT in P 


Abstract 

There are several difficulties in finding mechanisms that allow us to solve the Boolean satisfiability problem in 
polynomial time. Despite of all, here we will present two methods, a simple one to help us understand that it is 
possible, and a more complex one for when much larger inputs are available. In both cases we will use the power of 
mathematics to solve a logical management problem. By means of linear equations, or by means of abstract algebra we 
will obtain conclusive results and demonstrate why the algorithms work without risk of backtracking. 
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1. Introduction. 

A Turing machine is a mathematical model of computation that defines an abstract machine that 
manipulates symbols on a strip of tape according to a table of rules. It is considered that in that 
machine it is possible to denote any enumerable problem, and when each of the intermediate 
states are exactly determined Alan Turing designated that as an automatic Turing Machine [1] (a- 
TM, deterministic TM), if, on the contrary, it is always possible that the next state can be a finite 
number of possible states then that machine is denoted a choice Turing Machine (c-TM, non- 
deterministic TM, NDTM). 


The Boolean satisfiability problem (sometimes called propositional satisfiability problem and 
abbreviated SATISFIABILITY or SAT) is the problem of determining if there exists an 
interpretation that satisfies a given Boolean formula. That problem is usually reduced to formulas 
which are a Boolean product of sum of literals. Where a literal is a Boolean variable negated or 
not, and a sum of literals is called a clause. 


If we write on a tape the expression of a product of clauses and ask what values the variables 
must take to satisfy the formula, 1f we had those values we would only need polynomial time to 
determine if the formula is satisfied. That is why in a NDTM it is considered to be bounded in 
polynomial time, because in the absence of determining a finite set of values proportional to the 
input it would be solvable in polynomial time. Therefore, the SAT problem is considered trivially 
within the NP class. Having a mechanism that allows us to know what values the variables should 
take in polynomial time would convert SAT to the class P. More details on the open problem in 
question can be found in [2]. 


2. Method used. 

We will use two methods to support this statement: one is simple to understand and the other 
requires some mathematical commitment. The second method has the advantage that it works 
faster the more machines are involved because it is easy to distribute in a network of computers 
working in parallel. 


2.1 Method 1 


For the first method we will have any well-formed formula and find a way to express it as a 
system of linear equations modulo 3 with only one restriction: that all variables must be non-zero. 
It is easy to find a system of inequalities modulo P, being an odd prime number, that represents a 
product of sums of Boolean literals. Therefore, since the restriction that the variables must not be 
0 is equivalent to finding an equivalent in inequation format, the simplest representation has been 
chosen. 


Let's start by looking at Table 1 with a simple equation: x + y +A +B =1 (mod 3) knowing that 
X, y, A, B cannot be 0. 
Table 1. x+y+A+B =1 (mod 3) 
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Note that there are five entries, if we consider that x mod 2 is the way we have to establish 
equivalence towards modular arithmetic, then 7x = 2x. Moreover, we observe that B = xy, except 
in the last entry; which is an interesting operator to represent. 


More specifically, B=x O y (mod 2). So 1f we study what happens with the negates of x and y 
then it will happen that B =—x O —y and we create the following table 2: 


Table 2. x+y+A+B =1 (mod 3) 
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We now observe that when we combine the equations 
x+y+A+B=1l (mod 3) 
2x+2y+C+B=1 (mod 3) 
we manage to make B=xy equivalent to x=y operation, and we also see that A corresponds to the 
AND operation and C corresponds to the NOR operation. Therefore we already have more than 
enough operators to represent any Boolean equation by means of a system of equations in base 3, 
with the restriction that 1ts variables do not cancel each other out. 





One of the operations that must be taken into account when solving our system of equations is 
when the solution is already determined, and it is not possible to find degrees of freedom. There 
are only two valid possibilities for a variable: 
x =1 (mod 3), 
x =2 (mod 3), 
and four others for two variables: 
x+y=l1 (mod 3) <=> x =2 (mod 3) % y =2 (mod 3) 
x + y =2 (mod 3) <=> x=1 (mod 3) % y=1 (mod 3) 
x+2y=1 (mod 3) <-> x =2 (mod 3) £ y=1 (mod 3) 


x +2y =2 (mod 3) <=> x=1 (mod 3) 4 y =2 (mod 3) 


There are also 4 simplifiable homogeneous equations: 

x+y+z=0 (mod 3) <> x + 2y =0 (mod 3) £« y + 2z =0 (mod 3) 
x +2y +z=0 (mod 3) <=> x + y =0 (mod 3) 4£ y +z=0 (mod 3) 
x + y +2z=0 (mod 3) <=> x + 2y =0 (mod 3) € y +z=0 (mod 3) 
x + 2y + 2z =0 (mod 3) <-> x + y =0 (mod 3) € 2y + z=0 (mod 3) 























Knowing that, depending on how you want to implement the solver, when encountering equations 
such as x + By =0 (mod 3) automatically the column y can be eliminated and having the value of 
x we can deduce: y = 2Bx mod 3. 


We will call the clarifications made on simplification and equation determination system closure: 
in the sense that they may have to be applied repeatedly until we find a system of equations with 
enough variables per equation to continue solving deterministically. 


To demonstrate that it is possible to create a solver that solves in polynomial time and space the 
SAT problem, we will begin by recommending that for each pair of literals to which we must 
perform a logical operation, we calculate the four equations for the variables i, ] with 1<]: 








Vi+ Vj¡+ Ay1i¡ + X¡ =1 (mod 3) 
Vi j 2V; Aoi 2X; =1 (mod 3) 
2Vi vi Ani 2X; =1 (mod 3) 
2V; 2V; j Aoi; + Xi =1 (mod 3) 





























Note that when forming the matrix, the columns representing V, will be in one order, the A's in 
another and the X's in another, so we will arrange the smaller index columns with the X's, then 
place the A's and finally put the V's in that order. 

Moreover, an additional condition is observed: 





Ani + Aroi + Aori + Aoo =1 (mod 3) 





We will call this submatrix of five new rows and five new columns to be added for each 
relationship i, j that is found the box(i, j). 


Thus, once the matrix is formed, if a basis is sought from the first columns, it is most likely that 
the variables V, will be left as parameters accordingly. Another aspect to take into account is 
what would happen if the matrix ends up having more rows than columns, or the same amount; in 
that case you would already have the solution. And if any of the variables had to be 0 or end up in 
an inconsistency, that means there is no solution. 


Of course, as with Gaussian triangulation, if a row is found to end in O = O then it must be 
eliminated. So we proceed to show the algorithm: 


1. Transform the Boolean formula into a circuit. | O(n?) | | 















































2. Order the columns that will be needed for the circuit. O(n) 
3. A box(i, j) is generated for each gate(i, j) of the circuit. O(m) 
4. The identity of the matrix with the first columns is calculated. O(n?) 
5. The system closure is calculated Om)  0O(n) 
6. If there are no more columns than rows: Solve and END. O(n) 
7. A supposition is launched from the last columns O(n) 
8. If the supposition is correct: Solved and END. Om) 
9. If fails: form new boxes that deny the supposition to be added. Om) 
10, Reduce the new rows of the boxes from the identity (normalise). Om) 
11, If the new rows were eliminated: Unsatisfiable and END. 1 

12. Jump to 5 O(m) 





Note that you jump to 5 from 12 as many times until the number of rows equals or exceeds the 
number of columns, knowing that a box always adds five new rows for five columns, unless the 
circuit pair 1, j has already been used before, in which case only the circuit box that has not been 


referenced is incorporated. 


That box will override the parameters on the previous basis so that next time the assumption is 
made with fewer degrees of freedom. So for each assumption the number of columns that 


generated the indeterminacy is reduced by at least one. 


Table 3. Example of how matrices are transformed. 

























































































Base A v 
1 0 0 x x x 
0 1 0 x x x 
0 0 1 x x x 
Base A v 
1 0 0 x x x 
0 1 0 x x x 
0 0 1 x x x 
0 0 0 C C 0 
0 0 0 C C 0 
Base A v 
1 0 0 0 0 x 
0 1 0 0 0 x 
0 0 1 0 0 x 
0 0 0 1 0 0 
0 0 0 0 1 0 











As you can see in Table 3, between the new columns, the new rows, the columns that are used 
and that form part of the new base, in the end the free parameters are cornered and the number of 
rows to be satisfied increases. But it can never increase beyond the number of columns that are 
Incorporated into the base. That is why the iteration is linear. 


2.2 Method 2 

It is possible to speak of a third method whose complexity is lower, provided that it takes 
advantage of having several machines running in parallel. Using matrix notation, n machines on 
an input of n clauses can distribute the work of multiplying the matrix by itself and go from 
having a cost of O(n? log n) to being O(n log n) for large inputs. 


To do this, the following rules must be available: 

1. Two clauses of size n and m generate a submatrix of 1's of nxm. 

2. Every literal 1 and j that are opposites becomes 0 in (1, j). 

3. Every literal 1 and j that are the same becomes 0 ink%X 1 (1, k) andkF] (k, j). 


In this way a matrix is formed with the submatrices relating all clauses to themselves to form a 
square matrix of size mxm where m is the sum of all literals along the input, or the product of the 


number of literals per clause times the number of clauses if it was the same. 


Starting from these elements we will define a ring using the product and sum operators as 
follows: 


Table 4 How is defined in the ring product and sum. 


AxB=A 8 (AB>-n) Product of matrices, comparation if the result if greater or equal to n 
where n is the number of clauses 

















A+B=AO0B XOR opperation element by element 
0: matrix full of 0's neutral element in the sum 
1: matrix identity neutral element in product 








To proceed to demonstrate the distributive property we will use what is demonstrated in the 
appendices, knowing that as the operator (A 8 (B € C)>=n) can be translated as: 


(Aj 8(Bi O Ci)>=n) 
then it follows trivially that it is equivalent to 
(XA 8€Bi O Aj 8C;)>=n) 
in order to guarantee the distributive property between the matrix operation and the xor. 


The matrix product consists of calculating the scalar product of row and column and returning 1 
1f the result is greater than or equal to n; n is the number of clauses. 


When the matrix multiplied in this way with itself gives the same matrix we will say that 1t has 


reached a Noetherian ideal, and if it maintains the main diagonal then we will proceed to find a k- 
th solution. To find the k-th solution we only have to choose a pair of clauses whose submatrix 
has more than one 1 and we select only one 1, then we calculate its noetherian ideal and if the 
result is O then it was inconsistent, if on the contrary we repeat the procedure until the matrix 
reflects a single minterm. 

For the next fórmula 


k=m 
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TO (x +y +) (Fx + y + z) (Fx +y +) 
(x 1 0 0 0 1 1 0 0 1 
+ y 0 1 0 1 0 1 0 1 0 
+7) 0 0 1 1 1 0 1 0 0 
Ex 0 1 1 1 0 0 1 0 0 
+>y 1 0 1 0 1 0 0 0 0 
+z) 1 1 0 0 0 1 0 0 1 
(Ex 0 0 1 1 0 0 1 0 0 
Eb 0 1 0 0 0 0 0 1 0 
+2) 1 0 0 0 0 1 0 0 1 
TRAIT Tp t+Tm Toy +Ti T9]=3 

0 1 1 1 0 0O|f0 1 1 0 1 1|/1 0 0 0 0 11|/1 0 O 
1 0 10 1 ON1 O 1|+11 O 1|O 1 OI+[O0 1 OJO O O]|[>3 
11.0 0 0 1/11 1 0 11 0/10 O 1 10 0/10 O 1 

O 1 1 0 1 1 O 1 1 0 0 1 

1. 0 1ANÍ1 O 1|+[1 O 1|+0 O O||[=3 

1.1.0 11.0 11.0 100 

O 1 1 0.0 1 0 0 1 

1.0 1|¡A0 O O|=[0 O O|=7xXATIZVxAZ 

11.0 10.0 100 


What this algorithm should induce us is that it is still possible to find methods that on the most 
powerful machines can give us more efficient results than the quadratic one. 
It is easy to see this algorithm will work because this little lemma: 


4/(4,-1)=0>[ | 4,=u(-n+), 4) 


where u(x)=1 iff x is not negative, else 0. Considering a matrix is formed by subtables, each table 
with only one solution is enough to be a sufficient condition to ensure the satisfiability of the 
complete formula under a Noetherian ideal because every solution will get catched and: 


Table 5. After multiplying there is no incoherence added. 














SAT(f) > Noetherian(f) 
l 1 1 
0 1 1 
0 1 0 














So after finding a solution if there is noone f is unsatisfiable. 


On the other hand, as can be seen in annex 2, if a solution is added to an ideal that is inconsistent 
and not 0, then its ideal will be 0. Therefore, once the ideal is found, either there is a solution or 
there is no solution, but there cannot be a solution mixed with an ideal without a solution - so 
there is no backtracking. 


3. Conclusions 

This paper has presented two independent methods that solve SAT in polynomial time. The aim 
of the first method is to simplify as much as possible the formalisation of the problem until 1t is 
solved, in order to transform it into modular linear equations. The aim was to be able to perform a 
triangulation using a more generic method than the Gaussian method. 


This first method could be a way of continuing another chapter in the history of modular 
equations [3], just as the second method may well be the most practical way of building on the 
work of Noether [4], to invite more expert mathematicians to design their own rings by 
improving on method 2. They only need to be designed in such a way that when defining the sum 
and the product, the consistency of the results can be demonstrated and their ability not to 
incorporate valid solutions to an initially solutionless formula. 


Thanks to these results, it is easy to review how to restate the problems exposed in [5] using 
method 1. Also review the open problems in [6] with method 2 in order to design better rings to 
manage parallel processors. 
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Appendices 


Anmnex 1. Demonstration Ax(B+C)=AxB+AxC 


Lemma 1. Distributive relation between A ande 







































































X A Y o E A Z o xX A (Y O Z) 
0 0 0 0 0 0 0 1 0 0 0 0 0 
0 0 0 0 0 0 1 1 0 0 0 1 1 
0 0 1 0 0 0 0 1 0 0 1 1 0 
0 0 1 0 0 0 1 1 0 0 1 0 1 
1 0 0 0 1 0 0 1 1 0 0 0 0 
1 0 0 1 1 1 1 1 1 1 0 1 1 
1 1 1 1 1 0 0 1 1 1 1 1 0 
1 1 1 0 1 1 1 1 1 0 1 0 1 
Tf, to form a ring, we have an intermediate operator O that satisfies: 
1) the product A x B has the form AA(A OB), 
2) the sum A +B has the form A OB 
3) anditis satisfied that: AS(BOC)=ABBO ABC, 
then: 
Ax(B+C)=AxB+AxC 
Dem. 
In the process the expression will be changed to an always equivalent expression 
Ax(B+C) 
AA(AO(B OC) 
AA(AGOBOGASC) 
AA(AGOB)OAAM(AONSC) [By Lemma 1] 














AxB+AxC 





Anmnex 2. Given two non-zero ideals which contains identity whose conjunction has 
as ideal 0, their adhesion has as ideal 0. 


Given A and B non-zero ideals which contains identity, the adhesion is 
AVB=A OBOBAAB=A + B+AAB 


So let us study the ideal of an expression of the form (A + B + e) with A and B ideals and ideal of 
e is 0. The ideal of (A + B + e) shall take the form of (Aa + BP + ey) and the relationship will be 
expressed in this way: 


(A+B+8) >(A0.+BB+ey) eq 1 


Being an ideal means that (A + B + e) (Aa + Bf + ey) = (Aa + BP + ey) 





Likewise: AA = A, BB = B; therefore, 
(A+B +8) (Au. + BB +ey) = (Au. + BB+ey) 
(A+B+1) (Ao. +BB)= e (ey + Au + BB +7) 








Considering e >0 from the statement, and since the ideal of (A+B+1) is a case ofe=AAB=1, 
1t follows that Aa + Bf =0. So, 
(A+B+Ee)>(Aa + BB + ey) > ey >0 


So, if we study the other possibilitry: (A+B +1) —>(Aa+BB + ey)=Z 

















(A+B+1)Z=Z 
(A+B)Z=0 
Z=0 
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